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TRAP-Tagging: a novel method for the identification and purification of 

RNA-protein complexes 

Field of Invention 

This invention relates to a method for the identification and purification of RNA-protein 
complexes formed in vivo and in vitro. 

Background 

In addition to serving as essential intermediates between genes and proteins, RNA molecules 
also serve structural and regulatory roles in a rapidly growing list of biological processes. 
These include all of the basic steps of transcription initiation, splicing, localization,, 
translation and stabiUty (Dreyfiiss, et al., 2002; Szymanski et al., 2003; Doudna and Rath, 
2002; Erdmann et al., 2001; Pesole et al., 2001; Berkhout et al., 1989) as well as processes 
such as dosage compensation (Bell et al., 1988; Lee and Jaenisch, 1997; Meller et al., 2000; 
Salido et al., 1992), heterochromatin formation (Lee et al., 1997) and, telonaere maintenance 
(Le et al., 2000). Importantly, the genomes of many viruses are encoded as RNA rather than 
DNA, and much of their infective cycles are controlled by RNA biochemistry (Berkhout et 
al., 1989), as are the host defense systems that block the infection process (Mahalingam et al., 
2002). Clearly, these molecules and processes are crucial for cell and pathogen viabiUty, and 
are excellent targets for drug intervention. 

The recently coined term ribonomics has been defined as a complete understanding of 
loRNA metabolism, structure, interactions and function (Kleene 2001; Tenenbaum et al., 
2000), A comprehensive cataloguing of all ribonucleqprotem (RNP) complexes is an 
essential aspect of this major endeavor. However, the methodologies currently employed to 
identify RNA associated molecules are not ideally suited for such an endeavor. For example, 
RNA binding proteins generally do not have the same specificity as DNA buiding proteins. 
Consequently, techniques that identify individual RNA-protem interactions firequently isolate 
proteins that are irrelevant to the processes being studied. Indeed, there is increasing evidence 
that many high affinity RNA/protein interactions require multiple contacts between cis-acting 
elements and several proteins within a complex (Chartrand et al., 2001). This complexity has 
several deleterious effects for the detection of interactions in vitro. First, if individual 
interactions are weak, they may not occur in vitro. Second, if pre-formed multimeric 
complexes are stable, individual componraits may not be available for de novo assembly. 
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This would lead to the isolation of other more available and abundant molecules that are 
irrelevant to the process being investigated, 

A method capable of isolating specific KNA-protein complexes that form in vivo would 
circumvent many of the above-listed problems. If a similar method could be used to analyze 
complexes in vitro, the similarity between the results would indicate whether or not the in 
vivo process was required to study the RNA-protein complex in question- 
Summary 

The invention provides a method for purifying an RNA-protein complex formed in vitro 
comprising providing an RNA fusion molecule comprising a target RNA sequence and at 
least two different RNA tags, wherein at least one RNA tag interacts with a ligand in a 
reversible manner; contacting the RNA fusion molecule with a cellular extract; providing 
conditions that allow the formation of an RNA-protein complex on the target RNA sequence; 
and subjecting the RNA-protein complex to at least two different affinity purification steps, 
each step comprising binding one RNA tag to an aflSnity resin capable of selectively binding 
one RNA tag and elutmg flie RNA tag from the affinity resin after substances not bound to 
the affinity resin have been removed. In one embodiment the RNA fusion molecule is 
contacted with a protein mixture in place of a cellular extract. 

The invention also provides for a method for purifying an RNA-protein complex formed in 
vivo comprising: expressing in a eukaryotic cell an RNA fusion molecule comprising a target 
RNA sequence and at least two different RNA tags, wherein at least one RNA tag interacts 
with a ligand in a reversible manner; providing conditions that allow the formation of an 
RNA-protein complex on the target RNA sequence; generating a cellular extract; subjecting 
the cellular extract to at least two different affinity purification steps, each step comprising 
binding one RNA tag to an affinity resin capable of selectively binding one RNA tag and 
eluting the RNA tag from the affinity resin after substances not bound to the affinity resin 
have been removed. 

The invention also provides for a protein identified by isolating an RNA-protein complex 
formed in vitro or in vivo according to the methods of the current invention. 
In one embodiment, at least one RNA tag binds to an aflfinity resin through a fiision protein 
comprising a polypeptide that binds specifically to the RNA tag and a polypeptide that binds 
specifically to the affinity resin. In a preferred embodiment, the polypeptide that binds 
specifically to the affinity resin is selected from the group consisting of a maltose binding 
protein, a 6-histidine peptide, glutathione S transferase and a portion thereof sufficient to bind 
specifically to the affinity resin* 
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Another aspect of the invention is an RNA fusion molecule comprising a target RNA 
sequence and at least two different RNA tags, wherein at least one RNA tag interacts with a 
ligand in a reversible fashion. 

In one embodiment of the present invention at least one of the RNA tags is repeated. la a 
preferred embodiment the RNA tags are selected from streptavidin binding sequence (SI), an 
MS2 coat protein binding sequence, a streptomycin binding sequence (Strq)totag), a 
sephadex binding sequence p8), an N protein binding sequence (nut), a REV binding 
sequence, a TAT-binding sequence and an R17 coat protein bindiug sequence. In yet another 
preferred embodiment the RNA tags are at least one MS2 coat protein binding sequence and 
at least one streptavidin binding sequence. Li a most preferred embodiment the RNA tags are 
six MS2 coat protein binding sequences and two streptavidin binding sequences. 
In another embodiment of the current invention, the RNA fusion molecules further comprise 
at least one insulator sequence. 

The invention also provides for isolated DNA constructs encoding the RNA fusion molecules 
of the present invention and for vectors and host cells expressing the isolated DNA 
constructs. 

The invention relates to a method for screening test compoimds or proteins for their ability to 
modulate or regulate an RNA-protein complex by performing the methods of the present 
invention for purifying RNA-protein complexes formed in vitro or in vivo and observing a 
difference, if any, between the RNA-protein complexes purified in the presence of the test 
compounds or proteins and the absence of the test compounds or proteins, wherein a 
difference indicates that the test compounds or proteins modiilate the RNA-protein complex. 
This invention provides an isolated DNA construct comprising a transcription cassette, which 
comprises a promoter sequence, a bait sequence operably linked to the promoter, a 
transcriptional termination sequence which comprises a stop signal for RNA polymerase and 
a polyadenylation signal for polyadenylase, and at least two tag sequences. 
In one embodiment the isolated DNA construct comprises at least one streptavidin binding 
sequence [SEQ ID NO:l SEQ ID NO:2 SEQ NO 17] and at least one MS2 coat protein 
binding sequence [SEQ ID NO:4, SEQ ID NO:6 SEQ ID NO:7 SEQ NO 18]. In yet another 
embodiment, the isolated DNA construct comprises at least one tag sequence which 
hybridizes to the streptavidin binding sequence [SEQ ID NO:2] and at least one tag sequence 
which hybridizes to the MS2 coat protein sequence [SEQ ID NO:4] under high stringency 
hybridization conditions. 
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The invention also provides an isolated DNA constnict comprising a transcription cassette, 
which construct comprises, a promoter sequence, a bait sequence operably linked to the 
promote, a transcriptional termination sequence, which comprises a stop signal for RNA 
polymerase and a polyadenylation signal for polyadenylase; and at least three tag sequences. 
In another embodiment the isolated DNA construct comprises at least one streptavidin 
binding sequence [SEQ ID NO:2 SEQ NO 17] and at least two MS2 coat protein binding 
sequences [SEQ ID NO:7 SEQ NO 18]. In yet another embodiment the isolated DNA 
construct at least one tag sequence which hybridizes to the streptavidin binding sequence 
[SEQ ID NO:2 SEQ NO 17] and at least two tag sequences which hybridize to the MS2 coat 
protein sequence [SEQ ID N0:7 SEQ NO 18] under high stringency hybridization 
conditions. 

In one embodiment, the isolated DNA constructs further comprise at least three insulator 
sequences, and in another embodiment at least four insulator sequences. 
The present invention relates to expression vectors and host cells comprising the isolated 
DNA constructs. 

Another aspect of tiie invention is an RNA fusion molecule comprising a target RNA 
sequence and at least two RNA tags, wherein at least one of the RNA tags interacts with a 
ligand in a reversible fashion. In one embodiment the RNA fusion molecule comprises at 
least one streptavidin binding tag [SEQ ID NO:3] and at least one MS2 coat protein binding 
tag [SEQIDNO:5]. 

The current invention also relates to an RNA fusion molecule comprising a target RNA 
sequence and at least three RNA tags, wherein at least two of the RNA tags interact with a 
Hgand in a reversible fashion. In another embodiment, the RNA fusion molecule comprises 
at least one streptavidin binding tag [SEQ ID NO:3] and at least two MS2 coat protein 
binding tags [SEQ ID NO:8]. 

In one embodiment, the RNA fusion moleciiles further comprise at least 3 insulators, and in 
another embodiment, 4 insulators. 

The invention provides a method for isolating an RNA-protem complex formed in vivo 
comprising, expressing in a eukaryotic cell an RNA fusion molecule of the current invention, 
generating a whole cell extract, passing the extract over a first solid support comprising 
strq>tavidin protein, eluting a first eluate witih the addition of biotin, collecting the first 
eluate, passing the first eluate over a second solid support comprising MS2 coat protein, 
eluting a second elute with the addition of a reagent selected firom the group consisting of 
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glutathione, RNAse or a denaturant, and collecting the second elute, wherein the second . 
eluate contains the isolated RNA-protein complex. 

The current invention provides a method of identifying a protein in an RNA-protein complex 
comprising isolating an RNA-protein complex formed in vivo comprising, expressing in a 
eukaryotic cell an RNA fusion molecule of the current invention, generating a whole cell 
extract, passing the extract over a first solid support comprising streptavidin protein, eluting a 
first eluate with the addition of biotin, collecting the first eluate, passing the first eluate over a 
second solid support comprising MS2 coat protein, eluting a second elute with the addition of 
a reagent selected from the group consisting of glutathione, RNAse or a denaturant, and 
collecting the second elute, wherein the second eluate contains the isolated RNA-protein 
complex and identifying the protein in the RNA-protein complex. 

The invention also provides for a protein identified by performing the methods of isolating an 
RNA-protein complex formed in vivo. 

Another aspect of the current invention is a method for isolating an RNA-protein complex 
formed in vitro comprising, (a) expressing a RNA fusion molecule of the current invention in 
vitro, (b) obtaining a whole cell extract, (c) passing the whole cell extract over a first solid 
support comprising streptavidin protein, (d) eluting a first eluate with the addition of biotm, 
(e) collecting the first eluate, (f) passing the first eluate oyer a second solid support 
comprising MS2 coat protein, (g) eluting a second elute with the addition of a reagent 
selected fix)m the group consistmg of glutathione, RNAse or a denaturant, and (h) collecting 
the second eluate, wherein the second eluate contains the isolated RNA-protein complex. In 
one embodiment steps (c) to (e) are repeated. 

The current invention provides a method of identifying a protein in an RNA-protein complex 
conq)rising isolating an RNA-protein complex formed in vitro comprising (a) expressing a 
RNA fusion molecule of the current invention in vitro, (b) obtaining a whole cell extract, (c) 
passing the whole cell extract over a first solid support comprising streptavidin protein, (d) 
eluting a first eluate with the addition of biotin, (e) collecting the first eluate, (f) passing the 
first eluate over a second solid support comprising MS2 coat protein, (g) eluting a second 
elute with the addition of a reagent selected firom the group consisting of glutathione, RNAse 
or a denaturant, and (h) collecting the second eluate, wherein the second eluate contains the 
isolated RNA-protein complex and identijfying the protein in the RNA-protein complex. In 
one embodiment, steps (c) to (e) are repeated. 

The invention also provides for a protein identified by the methods of isolating an RNA- 
protein complex formed in vitro . 
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The invention also relates to a method of screening for a compound that modulates the 
fomiation of an RNA-protein complex formed in vivo comprising, expressing in a eukaryotic 
cell an RNA fusion molecule of tiie current invention in the presence of a test compound, 
generating a whole cell extract, passing the extract over a first solid support comprising 
streptavidin protein, eluting a first eluate with the addition of biotin, collecting the first 
eluate, passing the first eluate over a second solid support comprising MS2 coat protein, 
eluting a second eluate with the addition or a reagent selected firom the group consisting of 
glutathione, RNAse or a denaturant, collecting the second eluate, wherein the second eluate 
contains the isolated RNA-protein complex, measuring th amount of isolated KNA-protein 
complex present, and comparing the amoxmt of isolated RNA-protein complex present in the 
absence of the compound to be tested. 

The invention also provides for a method of screening for a compovmd that modulates the 
formation of an KNA-protein complex formed in vitro comprising, (a) expressing an RNA 
fusion molecule of the cinrent invention in vitro, (b) obtaining a whole cell extract, (c) 
passing the whole cell extract ovct a first solid support comprising streptavidin protein, (d) 
eluting a first eluate with the addition of biotin, (e) collecting the first eluate, (f) passing the 
first eluate over a second solid support comprising MS2 coat protein, (g) eluting a second 
eluate with the addition of a reagent selected firom the group consisting of glutathione, 
RNAse.or a denaturant, (h) collecting the second eluate, wherein the second eluate contains 
the isolated RNA-protein complex, (i) measuring the amount of isolated RNA-protein 
complex present; and (j)comparing the amount of isolated RNA-protein complex present in 
the absence of the compound to be tested. In one embodiment, steps (c) to (e) are repeated. 
The invention also relates to the compounds or proteins that modulate the RNA-protein 
. complexes and that are identified by the screening methods of the current invention. 
The invention also provides for kits for detecting an RNA-protein complex comprising the 
RNA fusion molecules, the isolated DNA constructs and the vectors of the present invention. 
Detailed Descriptioii of the Drawings 

Preferred embodiments of the invention will be described in relation to the drawings in 
which: 

Figure 1. Tandem RNA affinity purification. A) RNAs of interest are tagged at their 5' or 3' 
end with two different RNA tags. The tagged RNAs are then expressed either in vitro or in 
vivo and tested for iunction. B) Functional complexes containing the tagged RNA are 
purified fix)m extracts using two afiSnity resins, each of which is capable of binding one of 
the tags. An important aspect of the tags, particularly the first tag used, is that it must be 
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capable of being dissociated from its affinity resin using conditions that do not disrupt the 
RNA-protein complex. Proteins eluted from the second resin are generally sufficiently pure 
for identification by SDS PAGE, silvCT staining, and Mass Spectrometry. Bound RNAs can 
also be identified using RTPCR or microarray analysis. 

C) Sequence of the TRAP cassette. Sequences in parentheses indicate each of the different 
functional motifs within the TRAP cassette. 
Figure 2. TRAP-tag purification using in vitro transcribed RNA. 
A) In vitro purification of proteins from extracts. Embryonic cytoplasnodc extracts were 
mixed with TRAP-tagged RNA or untagged control RNA and purified using TRAP. Eluates 
were subjected to SDS PAGE and silver staining. Lane 1 : no RNA added to the extract. Lane 
2: No bait RNA fused to the TRAP RNA. Lane 3: purification using TRAP RNA fused to a 
localization element from the 3'UTR of the Drosophila wingless gene mRNA (WLEl). . 
Lane 4: protein purification using TRAP RNA fused to a second transcript localizing element 
in the wingless mRNA 3 ' UTR (WLE2). Note that the RNAs containing the two baits 
(WLEl atnd WLE2) bind proteins that are not bound by the resins or TRAP RNA alone. B) In 
vitro purification of Bic-D from embryo extracts. Following the purification as described 
above, eluted proteins were subjected to SDS PAGE and then transferred to membranes for 
Western blotting with anti Bic-D antiserum. Lanes 2<4 are as described above. Note that the 
Bic-D signal is highly enriched in lanes 3 and 4 after IRAP purification with the WLEl and 
WLE2 localization elements. Bic-D was detected in the cmde extract (Lane 1) after much 
longer exposures (not shown). 

Figure 3. Localization of TRAP-tagged WLE ^(^Asm Drosophila embryos. 
To ensure that the TRAP-tag does not interfere with bait RNA function, WLE localization 
elements fused to TRAP RNAs were tested for localization activity in embryos. A) 
Fluorescently labeled untaggedWLE2 RNA (red) moves to the apical cytoplasm above the 
nuclei (green) after injection into a syncitial blastoderm staged embryo. B) A mutated WLE2 
element has no localizing activty. Labeled RNA remains below the nuclei. C) TRAP-tagged 
WLE2 RNA moves apically in the same manner as imtagged WLE2 RNA, indicating that the 
addition of the TRAP sequence has no obvious effect on localization function. 
Figure 4. In vivo TRAP 

A) TRAP purification using extracts in which TRAP-tagged WLE RNAs were expressed in 
vivo. Lane 1 : TRAP RNA with no bait; Lane 2: TRAP RNA containing a large portion of the 
wg 3'UTR that encompasses WLE2; Lane 3: TRAP-tag fused to a tandem duplication of 
WLE2; Lane 4: TRAP-tagged WLE2, Lane 5: TRAP-tagged WLEL 
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B) Western blot of TRAP purified proteins with anti- Bic-D antibody. Proteins loaded in each 
lane were purified using the TRAP constructs listed above. Fractions fi*om the load, 
streptavidin column eluate and MS2 column elute are as indicated below. 

C) Quantitation of Bic-D signals. BCL-generated band intensities were measured using a 
phosphoimager. Values shown are relative to background. 

Figure 5. Tandem RNA affinity purification. 

A) TRAP cassette DNA sequence. MS2 and SI motifs (indicated) are flanked by insulator 
sequences and restriction sites that facilitate the shufQing of motifs and insertion into various 
vectors. 

B) Schematic map of the 2XS1 and 2XMS2 cassettes introduced into in vitro (top) or in vivo 
(bottom) expression vectors. RNAs of interest can be tagged at their 5' or 3* end with two 
different RNA tags. Tagging at 5' end is shown here. 

C) Overview of the TRAP purification procedure. For the second affinity colxmm, elution can 
be achieved using RNAse (radicated), high salt, glutathione or denaturants. Alternatively, if 
RNA components are being identified, proteases can be used. 

Table 1. Suitability of tags for TRAP-tag purification. Tags used for affinity purification 
are shown in the left hand column. Sizes, affinity matrices, eluting reagents, and performance 
are shown in the columns to the right. Binding and elution efficiencies wore determined using 
^^P-labeled RNAs expressed in vitro and are expressed as percentage of label loaded. 
Detailed Descriptton 

The present invention will now be described more fully with reference to the accompanying 
drawings, in which preferred embodiments of the invention are shown. This invention may, 
however, be embodied in different forms and should not be construed as limited to the 
embodiments set forth herein. Rather, these embodiments are provided so that this disclosmre 
will be thorough and complete, and will fully convey the scope of the invention to those 
skilled in the art. 

The term **bait sequence" as used herein, is a cDNA or DNA sequence ttiat encodes a target 
RNA sequence. Examples of suitable bait sequences include RNAs, such as, the HIV Tat- 
binding tar element, the E. coli N protein binding box B element, and various recognition 
elements within RNA splice sites. 

The term ''isolated DNA sequence" as described herein includes DNA whether single or 
double stranded. The sequence is isolated and/or purified (i.e. from its natural environment), 
in substantially pure or homogeneous form, firee or substantially firee of nucleic acid or genes 
of the species of interest or origin other than the promoter or promoter firagment sequence. 
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The DNA sequence according to the present invention may he wholly or partially synthetic. 
The term "isolated" encompasses all these possibilities. 

The term "operably linked" as described herein means joined as part of the same nucleic acid 
molecule, suitably positioned and oriented for transcription to be initiated from the promoter. 
The terai '^promoter'' as described herein refers to a sequence of nucleotides from which 
transcription may be initiated of DNA operably linked downstream (i.e. in the 3' direction on 
the sense strand of double-stranded DNA). The promoter or promoter fragment may comprise 
one or more sequence motifs or elements conferring developmental and/or tissue-specific 
regulatory control of expression. For example, the promoter or promoter fragment may 
comprise a neural or gut-specific regulatory control element. 

The term *T)NA tag" as used herein refers to short DNA or cDNA sequences that encode a 
binding partner for a Ugand. The ligand may be any molecule that specifically binds to the 
binding.partner such as, antibiotics, antibodies or specific proteins. The DNA tags of the 
current invention may be located 3' or 5' to the bait sequence. DNA tags encode RNA tags. 
The terai 'TINA tags" as used herein refers to short RNA sequences that fimction as a binding 
partner for a ligand. The KNA tags must be short, fiilly modular and must not interfere with 
each other or with the target RNA sequence. At least one of the RNA tags must interact with 
its binding partner in a reversible fashion. 

The temi "transcription cassette" as used herein refers to a nucleic acid sequence encoding a 
nucleic acid that is transcribed. Cassettes described herein contain multiple components such 
as tags, insulators and suitable restriction sites. To facilitate transcription, nucleic acid 
elements such as promoters, enhancers, transcriptional termination sequences and 
polyadenylation sequences are typically included in ttie transcription cassette. 
The temi "cellular extract" as used herein refers to proteins isolated lysated cells; for 
example, nuclear, cytoplasmic or organelle extracts or fractions thereof or a mixture of 
purified or recombinant proteins; or a combination thereof 

The term "SI" as used herein refers to the streptavidin binding sequence as DNA [SEQ ID 

NO:l or SEQ ID NO:2] or RNA [SEQ ID NO: 3] 

« . 

The term "2xSl" as used herein refers to the streptavidin binding sequence as DNA [SEQ ID 
NO: 17] 

The term **MS2" as used herein refCTs to MS2 coat protein binding sequence as DNA [SEQ 
ID NO: 4] or RNA [SEQ ID NO:5]. 

The term "2xMS2" as used herein refers to two MS2 coat protein binding sequences as DNA 
[SEQ ID NO:6 and SEQ ID NO:7 and SEQ ID NO 18] or RNA [SEQ ID N0:8] 
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For more detailed reference of the sequences and what they are composed of: 

SEQ ID NO:l - SI DNA sequence including insulators with BgUI ends 

SEQ ID NO: 2 - SI DNA sequence gaccgaccagaatcatgcaagtgcgtaagatagtcgcgggccggg 

Bgin cloning site + spacers = 5' ATCGATAAAAA and 3' AAAAAATCGAT 

SEQ ID N0:3 - SI RNA sequence 

SEQ ID NO:4 - MS2 DNA sequence 

SEQ ID N0:5 - MS2 RNA sequence 

SEQ ID NO:6 - 2x MS2 DNA sequence including insulators with SacII ends 
SEQ ID NO:7 - 2X MS2 DNA sequence 
SEQ ID NO:8 - 2x MS2 RNA sequence 

SEQ ID NO:9 - Streptotag (streptomycin binding) tag DNA sequence with insulators and 
Kpnl 

SEQ ID NO: 10 - Streptotag (streptomycin binding) tag DNA sequence 
SEQ ID NO: 11 - Streptotag RNA sequence 

SEQ ID NO: 12 - Nut (N binding) DNA sequence with insxilators and Kpnl ends 
SEQ ID NO:13 - Nut (N binding) DNA seqeunce 

SEQ ID NO:14 - Nut (N binding) RNA sequence. This is the RNA produced by SEQ NO 12. 
SEQ ID NO:15 - D8 (Sephadex binding) DNA sequence 
SEQ ID NO: 16 - D8 RNA sequence 

SEQ ID NO :17 - TRAPSl DNA - SI tags wifli BgUI, Cla I restriction sites and spacers. 
SEQ ID NO: 18- TRAPMS2- MS2 tags with Sea I restriction sites and spacers. 
SEQ ID NO: 19- TAR DNA sequence 

The terminology used in flie description of the invention herein is. for the purpose of 
describing particular embodiments only, and is not intended to be Umiting to the invention. 
As used in the description of the invention and the appended claims, the singular forms "a", 
"an" and "the" are intended to include the plural forms as well, unless the context clearly 
indicates otherwise. Unless otherwise defined, all technical and scientific terms used herein 
have the same meaning as commonly understood by one of ordinary skill in the art to which 
this invention belongs. All publications, patent applications, patents and other references 
mentioned herein are incorporated by reference in their entirety. 
The preset invention relates to a method for isolating specific RNA-protein complexes 
formed in vivo. However, it can also be used to isolate or verify complexes formed in vitro. 
In vivo complex formation and purification is accomplished by expressing tagged versions of 
the RNA of interest in vivo and then using the tag to isolate associated fimctional RNP 
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complexes. Tags in the form of short RNA sequences that interact with specific proteins, 
antibiotics or synthetic ligands can be readily inserted 5' or 3' to the RNA of interest (see Fig 
lA). Although a number of these potential RNA tags exist, purification with these tags gives 
at most a thousand-fold purification of the associated RNAs. By using two RNA tags, the 
TRAP-tag method of the current invention provides approximately a million-fold purification 
of associated RNAs, which is sufficient for the identification of most cellular proteins. The 
tags in the current invention must be relatively short, fially modular, and must not interfere 
with each other or with the RNA of interest. Jn addition, at least one of the tags must interact 
with its ligand in a reversible fashion so that RNP complexes can be eluted intact firom the 
first ligand matrix and bound to the second matrix (see Fig. IB). When expressed in vivo, 
TRAP-tagged RNAs assemble into fimctional complexes, and these complexes are readily 
purified to homogeneity. 
Nucleic Acid Molecules 

Functionally equivalent nucleic acid molecule or polypeptide sequence 
The tenn "isolated DNA sequence" refers to a DNA sequence the structure of which is not 
identical to that of any naturally occurring DNA sequence or to that of any fragment of a 
naturally occurring DNA sequence spanning more than three separate genes. The term 
therefore covers, for example, (a) DNAwhichhas the sequence of part of a naturally 
occurring genomic DNA molecule; (b) a DNA sequence incorporated into a vector or into 
the genomic DNA of a prokaryote or eukaryote, respectively, in a manner such that the 
resulting molecule is not identical to any naturally occurring vector or genonaic DNA; (c) a 
separate molecule such as a cDNA, a genomic fragment, a fragment produced by reverse 
transcription of polyA RNA which can be amplified by PGR, or a restriction fragment; and 
(d) a recombinant DNA sequence th^t is part of a hybrid gene, i.e., a gene encoding a fiision 
, protein. Specifically excluded from this definition are nucleic acids present in mixtures of (i) . 
DNA molecules, (ii) transfected cells, and (iii) cell clones, e.g., as these occur in a DNA 
Ubrary such as a cDNA or genomic DNA library. 

Modifications in the DNA sequence, which result in production of a chemically equivalent or 
chemically similar amino acid sequence, are included within the scope of the invention. 
Modifications include substitution, insertion or deletion of nucleotides or altering the relative 
positions or order of nucleotides. 
Sequence identity 

The invention includes modified nucleic acid molecules with a sequence identity at least 
about: >95% to the DNA sequences provided in SEQ ID NO: 1, SEQ ID NO 2, SEQ ID NO 
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4, SEQ ID NO 6, SEQ ID NO 7. SEQ ID NO 9, SEQ ID NO 10, SEQ ID NO 12, SEQ ID NO 
13, SEQ ID NO 15, SEQ ID NO 17, SEQ ID NO 18, SEQ ID NO 19 (or a partial sequence 
thereof or their complementary sequence). Preferably about 1, 2, 3, 4, 5, 6, to 10, 10 to 25, 
26 to 50 or 51 to 100, or 101 to 250 nucleotides are modified. Sequence identity is most 
preferably assessed by the algorithm of the BLAST version 2.1 program advanced search 
(parameters as above). Blast is a series of programs that are available online at 
http//www.ncbi.nlm.nih.gov/BLAST. 
References to BLAST searches are: 

Altschul, S.F., Gish, W., MiUer, W., Myers, E.W. & Upman, D.J. (1990) "Basic local 
aligmnent search tool." J. Mol. Biol. 215:403_410. 

Gish, W. & States, D.J. (1993) "Identification of protein coding regions by database 
similarity search." Nature Genet. 3:266_272. 

Madden, T.L., Tatusov, R.L. & Zhang, J. (1996) "AppUcations of network BLAST server" 
Meth. Enzymol. 266:131_141. 

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W. & Lipman, 
DJ. (1997) "Gapped BLAST and PSI_BLAST: a new generation of protein database search 
programs." Nucleic Acids Res. 25:3389_3402. 

Zhang, J. & Madden, T.L. (1997) "PowerBLAST: A new network BLAST ^Ucation for 
interactive or automated sequence analysis and annotadoih" Genome Res. 7:649_656. 
Other programs are also available to calculate sequence identity, such as Clustal W program 
(preferably using default parameters; Thompson, ID et al.. Nucleic Acid Res. 22:4673-4680). 
DNA sequences functionally equivalent to the SI SEQ ID NO: 2, or MS2 SBQ TD NO: 4 can 
occur in a variety of forms as described above. 

The sequences of the invention can be prqpared according to numerous techniques. The 
invention is not limited to any particular preparation means. For example, the nucleic acid 
molecules of the invention can be produced by cDNA cloning, gaiomic cloning, cDNA 
synthesis, polymerase chain reaction (PGR) or a combination of these approaches (Current 
Protocols in Molecular Biology, F.M. Ausbel et al., 1989). Sequences may be synthesized 
using well-known methods and equipment, such as automated synthesize. 
Hybridization 

Other functional equivalent forms of the SI SEQ ID NO: 1 and SEQ ID NO: 2 and MS2 
DNA SEQ ID NO: 4 molecules can be isolated using conventional DNA-DNA or DNA-RNA 
hybridization techniques. These nucleic acid molecules and the SI SEQ ID NO: 1, SEQ ID 
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NO: 2, SEQ ID NO 17 and MS2 SEQ ID NO 4, SEQ ID NO 6, SEQ ID NO 18 sequences can 
be modified without significantly affecting their activity. 

The present invention also includes nucleic acid niiolecules that hybridize to one or more of 
the DNA sequences provided SI SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 17 and MS2 
SEQ ID NO: 4, SEQ ID NO: 6, SEQ ID NO: 1 8 (or a partial sequence thereof or their 
complementary sequence). Such nucleic acid molecules preferably hybridize to all or a 
portion of SI SEQ ID SEQ ID NO: 1, SEQ ID NO: 2, SEQ ID NO: 17 or MS2 SEQ ID NO: 
4, SEQ ID NO: 6, SEQ ID NO: 18 or their complement under low, moderate (intermediate), 
or high stringency conditions as defined herein (see Sambrook et al. (most recent edition) 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y.; Ausubel et al. (eds.), 1995, Current Protocols in Molecular Biology, 
(John Wiley & Sons, NY)). The portion of the hybridizing nucleic acids is typically at least 
15 (e.g. 20, 25, 30 or 50) nucleotides in length. The hybridizing portion of the hybridizing 
nucleic acid is at least 80% e.g. at least 95% or at least 98% identical to the sequence or a 
portion or all of a nucleic acid encoding SI or S2 or their complement. Hybridizing nucleic 
acids of the type described herein can be used, for example, as a cloning probe, a primer (e.g. - 
a PCR primer) or a diagnostic probe. Hybridization of the oligonucleotide probe to a nucleic 
acid sample typically is performed und^ stringent conditions. Nucleic acid duplex or hybrid 
stability is expressed as the melting temperature or Tm, which is the temperature at which a 
probe dissociates fi:om a target DNA. This melting temperature is used to define the required 
stringency conditions. If sequences are to be identified that are related and substantially 
identical to the probe, rather than identical, tihien it is usefiil to first establish the lowest 
temperature at which only homologous hybridization occurs with a particular concentration 
of salt (e.g. SSC or SSPE). Then, assuming that 1% mismatching results in a 1 degree 
Celsius decrease in the Tm, the temperature of the final wash in the hybridization reaction is 
reduced accordingly (for example, if sequences haviug greater than 95% identity with the 
probe are sought, the final wash temperature is decreased by 5 degrees Celsius). In practice, 
the change in Tm can be between 0.5 degrees Celsius and 1.5 degrees Celsius per 1% 
mismatch. Low stringency conditions involve hybridizing at about: IXSSC, 0.1% SDS at 
50°C. High stringency conditions are: O.IXSSC, 0.1% SDS at 65^C. Moderate stringency is 
about IX SSC 0.1% SDS at 60 degrees Celsius. The parameters of salt concentration and 
temperature can be varied to achieve the optimal level of identity between flie probe and the 
target nucleic acid. 



13 



wo 2004/033718 PCT/CA2003/001555 

The present invention also includes nucleic acid molecules from any source, whether 
modified or not, that hybridize to genomic DNA, cDNA, or synthetic DNA molecules that 
encode. A nucleic acid molecule described above is considered to be functionally equivalent 
to a SI nucleic acid molecule SEQ ID NO: 1, SEQ ID NO 2, SEQ ID NO 17 of the present 
invention if the sequence encoded by the nucleic acid molecule is recognized in a specific 
manner by streptavidin and is elutable by biotin. A nucleic acid molecule described above is 
considered to be functionally equivalent to a MS2 SEQ ID 4, SEQ ID NO 6, SEQ ID NO 7, 
SEQ ID NO 18 nucleic acid molecule of the present invention if the sequence encoded by the 
nucleic acid molecule is recognized in a specific manner by flie MS2 coat binding protein. 
Vectors 

The present invention provides an expression vector comprising a transcription cassette. The 
transcription cassette can be cloned into a variety of vectors by means that are well known in 
the art. Such a vector may comprise a suitably positioned restriction site or other means for 
insertion of a transcription cassette. The vector may also contain a selectable marker. For 
use in an assay or experiment, commercially available vectors such as a CMV Casper 
promoter vector may be employed. For use in gene therapy, vectors such as adenovirus may 
be employed. Cell cultures transfected or transformed with the DNA sequences of the 
current invration are useful as research tools particularly for studies of RNA-protein 
complexes. One skilled in the art will appreciate that there are a wide variety of suitable 
vectors. 
Host Cells 

A fiirther aspect of the present invention provides a host cell containing a transcription 
cassette of the current invention. Examples of particularly desirable host cells include yeast, 
ES, PI 9, COS, S2 and SF9 cells. Methods known in the art for transformation, include but 
are not limited to electroporation, rubidium chloride, calcium chloride, calcium phosphate or 
chloroquine transfection, viral infection, phage transduction, microinjection, and the use of 
cationic Upid and lipid/amino acid complexes or of Uposomes, or a large variety of other 
commercially available and readily synthesized transfection adjuvants, are useful to transfer 
tiie vectors of the current invention into host cells. Host cells are cultured in conventional 
nutrient media. The media may be modified as appropriate for inducing promoters, 
amplifying nucleic acid sequences of interest or selecting transformants. The culture 
conditions, such as temperature, composition and pH will be apparent After transformation, 
transformants may be identified on the basis of a selectable phenotype. 
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RNA fusion molecules 

The current inveintion provides for RNA fusion molecules comprising RNA tags, insulator 
elements and target RNA sequences. The RNA fusion molecule contains at least two 
dififerent RNA tags. Suitable RNA tags include, but are not Umited to streptavidin binding 
sequence, an MS2 coat protein binding sequence, a strieptomycin binding sequence 
(Streptotag), a sephadex binding sequence, an N protein binding sequence, a REV binding 
sequence, a TAT-binding sequence and an R17 coat protem binding sequence. In some 
embodiments of the invention, it will be suitable to have more than one copy of an RNA tag. 
For example, it may be desirable to have 2xMS2 coat protein binding sequence and 2X SI 
binding sequence (see Fig 5). In another embodiment, it may desirable to have from 3X to 
6X MS2 coat protein binding sequences and from 3X to 6X SI protein binding sequences. In 
general, increasing the number of RNA tags in the RNA fusion molecule increases the degree 
of purification of the resulting RNA-protein complex due to an increase in the affinity of the 
RNA-protein complex for the affinity resin. 

A target RNA sequence may be an oligoribonucleotide sequence or a ribonucleic acid 
sequence. Generally, for use in this invention, the target RNA sequence is RNA, including 
ribosomal RNA, RNA encoded by a gene, messenger RNA, UTRii, ribozyme RNA, catalytic 
RNA* small nuclear RNA, small nucleolar RNA, etc., from a microorganism, or an RNA 
expressed by a cell infected with a virus, or RNA from a host cell, or RNA encoded by a 
genomic sequence; or RNA encoded by a chemically synthesized DNA sequence or randord 
RNA encoded by randomly isolated DNA. 

Insulator elements may be placed on either side of the RNA tags and function to ensure 
proper folding of the RNA tags and to discourage mteractions between the tags and the target 
RNA sequence. Examples of suitable insulator elements include, but are not limited to 
stretches of 4-5 identical nucleotides (eg, adenosines) coupled with paired restriction sites 
that do not interact with the tag or bait sequences. The 5' and 3' restriction sites should be 
identical as these sequences can then hybridize, forming a stem that forces the ^'insulator" 
polynucleotide sequences to be "unpaired" thus isolating the intemal tag or bait structures 
from tiie remainder of the RNA sequences produced, from a specific vector. Insulator 
elements may also be called spacers. 
Method of Purifying 

The invention provides a metiiod for purifying an RNA-protein complex formed in vitro or in 
vivo. The isolated protein part of the RNA-protein complex may then be identified by 
various methods and techniques including but not limited to SDS-page, silver staining, 
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Western blotting and mass spectrometry. Examples of suitable solid supports for use with the 
different embodiments of the current mvention include affinity columns comprising bound 
streptavidin or bound MS2, wherein the MS2 can be bound to agarose or sepharose beads. 
MS2 affinity columns can also be made by ciosslinking to resins such as affigel beads, or 
binding as a fusion protein to an appropriate resin (eg GST-MS2 to glutathione beads). 
Method of Screening 

The current invaition relates to a method of screening for a compound that modulates or 
regulates the formation of an KNA-protein complex formed in vivo or in vitro. Other 
methods, as well as variation of the above methods will be apparent &om the description of 
this invention. For example, the test compound may be either fixed or inareased, a plurality 
of compounds or proteins may be tested at a single time. "Modulation", ''modulates", and 
"modulating" can refer to enhanced formation of the RNA-protein complex, a decrease in 
formation of the RNA-protein complex, a change in the type or kind of the KNA-protein 
complex or a complete inhibition of formation of the RNA-protein complex. Suitable 
compounds that may be used include but are not limited to proteins, nucleic acids, small 
molecules, hormones, antibodies, peptides, antigens, cytokines, growth fectors, 
pharmacological agents including chemotherapeutics, carcinogenics, or other cells (i.e. cell- 
cell contacts). Screening assays can also be used to map binding sites on RNA or protein. 
For example, tag sequences encoding for RNA tags can be mutated (deletions, substitutions, 
additions) and then used in screening assays to determine the consequences of the mutations. 
Kits 

The invention includes kits for detecting RNA-protem complexes comprising at least one 
isolated DNA construct of the invention or at least one vector of the current invention. 
Tandetn RNA purification 

A number of RNA motifs suitable as RNA affinity tags exist. We first tested five of these for 
potential use in our double-tagging system. These include the "streptotag", a streptomycin 
binding ^tamer (Bachler et al., 1999), "Sl",a streptavidin binding aptamer (Srisawat and 
EngeDce, 2001), *'D1", a sephadex binding aptamer (Srisawat et al., 2001), the MS2 phage 
coat protein binding RNA (Jurica et al., 2002), 'TAR", a Tat protein binding sequence 
(Puglisi et al., 1995) and the lambda phage box B RNA Q:.azinski et al., 1989). Table 1 
shows tiie relative binding and elution efficiencies of each ^^P -labeled tag and its ligand. 
Two of the five tags, the streptavidin (81, SEQ ID NO: 1 and SEQ ID NO: 2) and MS2 coat 
protein (MS2) tags, were found to bind and elute efficiently under the desired purification 
conditions. Importantly, neither tag cross-reacted with any of the other tested ligands. 
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Greater than 95% of the SI tag SEQ ID NO: 1 and SEQ ID NO: 2 bound to streptavidin 
agarose beads, of which 95% could be recovered with the addition of biotin. Approximately 
75% of flxe loaded MS2 tag bound to GST-coat protein- beads, and approximately 70% of the 
loaded tag could be eluted with glutathione. 
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Table 1 - RNA aptamer tags tested for use in TRAP vectors. 



RNA 


SEQ ID NO 


Length 


AfBnity target 


Eluted mth: 


% Bound 


% Elu 


aptamer 




(nucleotides) 










Streptotag 


9-DNA 


64 


8-hydroxy- 


Streptomycin 


21% ±2% 


12% ± 


U-RNA 




streptomycin 








MS2, 


4,6-DNA 


38, 96 


Coat BiiiQing 


Keducea 


/ Jyo ± J /o 


OO /O 3C 


2xMS2 


S.B-KNA 




Protein 


Glutathione 






CI 


1 FlMA 
1- UiNA 

3-RNA 


oo 


CSfrpnfjiviHiTi 

l3 U w Lr IcX V XVXIJLL 




>99% 


94% ± 


D8 


15 -DNA 
16 -RNA 


64 


Sephadex 


n/a 


34% ±1% 


21% ± 


TAR 


19-DNA 


29 


Tat Protein 


Tat Peptide 


80% ±5% 


NE 


Nut 1-39 


12- DNA 
14 -RNA 


33 


N-protein 1-22 


n/a 


<1% 


<1"/ 



Next the ability of the Streptavidin and MS2 coat protein tags to function together and in the 
presence of an KNA target molecule was tested. Cassettes containing a T7 promoter, the two 
RNA tags, altemative target RNA insertion sites and a poly A tail were made (Figure IB). 
Insulator elements, consisting of 8-10 Adenosines flanked by identical restriction sites, were 
placed on either side of each tag to ensure proper folding of the tags and to discourage 
interactions between the tags and the inserted target RNA. ^^P-labeled RNAs were first tested 
for retention and elution on streptavidin and GST-coat protein colunms. Both tags worked 
with much the same efficiency as when used individually. A construct containing 2XS1 tag 
SEQ ID NO: 17 and 2XMS2 tags SEQ E) NO: 18 are preferred. 
TRAP tag purification using in vitro transcribed RNA 

Next the constructs were tested for the abiUty to purify specific RNA binding proteins 
firom a complex protein mixture. Two, approximately 100-nucleotide long elements from the 
Drosophila wingless gene mRNA (WLEl and WLE2) were chosen for this purpose. These 
elements are required for the asyimnetrical localization of wingless transcripts to apical 
cytoplasm (Simmonds et al., 2001). The two elements show no similarity in sequence or 
predicted secondary structure and exhibit marked differences in their ability to localize 
transcripts. On the other hand, both appear to mediate localization via dynein-dependent 
microtubule transport (Wilkie and Davis, 2001). Hence, they probably interact with unique- 
but overlsqpping subsets of proteins. 

The tagged RNAs were expressed in vitro, and the cold RNA mixed for 30 minutes with 
Drosophila embryo extracts prior to purification over the two colunans. Figure 2A shows that 
each of the tagged localization elements did indeed associate with different subsets of 
proteins that were not bpund by beads or tags alone. Nine (9) of nineteen (1 9) proteins 
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identified by Mass spectrometry are known or predicted RNA binding proteins (Sitnmonds 
and Krause, in preparation). Figure 2B shows that one of these proteins is Bic-D, a protein 
jyreviously implicated as being required for apical mRNA transport in blastoderm stage 
Drosophila embryos (Bullock and Ish-Horowicz, 2001). 
Localization ofTRAP-tagged WLE UNAs in Drosophila embryos 
The fibaal test was to ensure that complexes formed on the tagged RNAs in vivo are both 
active and readily purified. To confirm this, tagged WUE constructs were first fluorescently 
labeled and injected into syncitial blastoderm stage embryos. RNAs with an apical 
localization motif will move fix)m the site of injection upwards, between the syncitial nuclei 
to the apical surface (Bullock and Ish-Horowicz, 2001). Figure 3A shows untagged WLE2 
RNA after localization to the apical surface. Figure 3C shows that TElAP-tagged WLE2 RNA 
localizes to the apical surface in an indistinguishable fashion. Thus, the tags appear to have 
no effect on the function of the localizing element. TRAP-tagged wingless localization 
elements expressed in transgenic embryos also localized apically (data not shown). Extracts 
were made fix)m these transgenic fly lines and used for purification of WLE-associated 
proteins. 

TRAP tag purification using RNA expressed in vivo. 

Figure 4 shows that, as in vitro, each of the tagged WLE constructs binds a different subset of 
proteins. The identities of some of these proteins were determined by Mass Spectrometry. 
Once again, one of the purified protdns included Bic-D. 

Note that, although the proteins identified here were easily detected using a small amount of 
extract and silver staining, the reversibility of tte two columns permits the optional use of a 
second round of purification to detect proteins of very low abundance and proteins that do not 
bind the bait stoichiometrically or m all cell types. The SI tag SEQ ID NO:l SEQ ID NO: 2 
is particularly well suited for repeated rounds of purification. It provides high degrees of 
purification with little loss of material, and the biotin used for elution is easily removed. 
Biotin removal is achieved by running the eluate over an avidin colvmm (the SI tag SEQ ID 
NO: 1 SEQ ID NO: 2 does not bind avidin). The flow-through is then boimd to the second 
streptavidin column and eluted with biotin as before. This approach can also be used for prior 
removal of streptavidin binding proteins, should they be present in extracts in large amounts. 
Clearly, this j^proach is s^licable to any cell or tissue type. The TRAP cassette is simply 
placed into an impropriate vector. Although the in vivo application of flie method is flie most 
■ powerful version of this ^roach, in vitro assays are also clearly applicable. For example, 
using mutagenesis, the importance of specific nucleotides and structural aspects of known or 
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newly discovered interactions can be rapidly tested with in vitro expressed KNAs and then 

confirmed in vivo. This approach is also amenable to high throughput analyses. This is 

particularly true for in vitro work with extracts, and with transfected or virally infected cells. 

With a little more effort, the approach can also be applied to transformed cells and transgenic 

tissues. For example, as has been done for proteins in yeast, TRAP tags could be placed 

within each yeast gene and substituted for the endogenous gene by homologous 

recombination. However, this approach is probably the most useful for small RNAs and 

functionally characterized RNA motifs. It is also possible to identify other KNAs bound 

within TRAP-purified complexes. This can be achieved either by RTPCR, or more globally 

by labeling the RNAs and hybridizing to microarrays. / 

Given the rapidly growing number of important processes controlled by RNAs and the 

proteins that bind them, TRAP-tagging should prove to be a key tool in the elucidation of 

these functions on a genomic scale. Once well characterized, functional RNA elements can 

serve as drug targets (RNAi etc). Viral RNAs such as HIV, hepatitis B, and tiie protems that 

bind them, are particularly applicable targets. Examples of such uses include the treatment of 

viral infections, the control of cellular proliferation and the stimulation of neuronal 

regeneration. 

Vector construction 

Initial TRAP vectors were constructed using a cassette-based ^roach to allow for maximum 
versatility. Cassettes were made using paired oUgonucleotides cloned into pSP72 (Promega). 
To facilitate further cloning into other expression vectors the plasmid was modified by 
addition of an Spel restriction site 3' to the polyUnker Xhol site using the paired 
olgionucleotides S'Spel TCGAGACTAGT and VSpel AGCTTGATCAG. 
The streptavidin aptamer was added by hybridization of S IBglllS ' 

(ATCTAAAAGACCGACCAGAATCATGCAAGTGCGTAAGATAGTCGCGGGCCGGGAAAAAA and 
SlBgHB' 

(ATCnTrriTCCCGGCCCGCGACTATCITACGCAC^ 

oligonucleotides and insertion into the Bgin site of pSP72.(see Fig 5) 

The MS2 ^tamer was created by hybridizing the oligos 

MS2 5' (CAAACGACTCTAGAAAACATGAGGATCACCCATGTCTGCAGG) and 

MS2 3' (TCGACCTGCAGACATGGGTGATCCTCATGTITTCTAGAGTCGTTTTro and the 

oligos MS2 5' (TCGACrCTAGAAACATGAGGATCACCCATGTCTGCAGGTCAAAAAGAGC^ and 

MS2 3* (CTTTXTGACCTGCAGACATGGGTGATCCrCATGTTTTCTAGAG), subcloning the tWO 

fiagments separately into pBluescript SK" (Stratagene) and were then ligating the excised 
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fragments together with the pSP72 vector linearized with SacU. Clones were then sequenced 
to identify those with MS2 aptamer sequences in the correct orientation. Primers used to 
create other tags tested include 5'Streptotag Kpnl 

(CAAAAGGATCGCATTTGGACTTCTGCCCAGGGTGGCACCACGTGCGGATCCAAAAGGTAC), 
3'Streptotag Kpnl 

(CITITGGATCCX}ACCGTGGTGCX:ACCCTGGGCAGAAGTa:AAATGCGATCCI^ 
5 'Kpnl (GATCCnrrCGGGTGAAAAAGGGCTTTTG) and N3 'Kpnl 

(GATCCAAAAGCCcrmTCAGGGCAAAG). Plasmids produced by these manipulations are 
referred to respectively as pTRAPSl. pTRAPMS2, pTRAPSlMS2, pTRAPN, pTOAPSlN. 
The wingless 3'UTR regions referred to as WLEl (wg 3'UTR 1-181), WLE2 (wg 3'XJTR 
659-773), 2x WLE2 (tandem duplication of WLE2) , WLE2-mutated (WLE2 with residues 
678-689 mutated to the sequence AGATCT) and wg 3'UTR 360-1 107 were ampUfied by 
polymerase chain reaction (PGR) and cloned into the BamHl site of the pTRAPSlMS2 vector 
to create the vectors pTRAPSlMS2+WLEl, pTRAPSlMS2+WLE2 ; 
pTRAPSlMS2+2xWLE2 pTRAPSlMS2+WLE2(mutated) and ptRAPSlMS2+wg 3'UTR 
360-1 107 respectively. For constructs that could be expressed in transgenic flies, Hpal -Spel 
fragements of pTRAPSlMS2+WLEl, pTRAPSlMS2+WLE2, pTRAPSlMS2+wg 3'UTR 
360-1107, pTRAPSlMS2+2xWLE2, pTRAPSlMS2+WLE2(mutated) orpTRAPSlMS2 (no 
insert) were subcloned into BglR-Stul cut pCASPER-HS (Thummel and Pirrotta 1992). The 
resulting vectors, pCASPER-THAPWLEl, pCASPER-TRAPWLE2, pCASPER- 
TRAP2xWLE2, pCASPER-TRAPWLE2 (mutated) and pCASPER-TRAP, were introduced 
into DrosophUa embryos by P-element-mediated transfonnation (Spradling and Rubin 1982). 
A T"inimiim of tiu-ee independent transgenic lines was isolated for each construct injected. 
Production ofGST-MCP beads 

A coat protein GST fusion was made by subcloning a PGR fragment consisting of the entire 
open reading frame, with a BamHI site added 3' and an Xhol site added 5', into the vector 
pGEX4T-l (Pharmacia). The fusion protein was expressed in E. coli BL21 cells grown at 
3rC for 3 hours (ODeoo of 1 .8) and then induced withlOOmM IPTG for 4.5 hours. Gells were 
pelleted in 250 ml aliquots, quick frozen in liquid nitrogen and stored for as long as 2 months 
at -70*'C. Cell pellets were lysed by sonication (5 min at 50%) and bound to Glutathione- 
Sepharose beads (Pharmacia) as specified by the manufacturer. After extensive washing, the 
fusion protein was cross-linked to the beads using 20mM dimethyl pimelimidate dissolved in 
200mM HEPES (pH 8.5) buffer (Bar-Peled et al., 1996). The cross-linked affinity resin can 
be stored for at least 6 months at -20'*C in storage buffer (HEPES pH 7.4, 80 mM NaCl, ImM 
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EDTA, ImM DTT, 40% glycerol). Alternatively, if glutathione elution from the coat protein 
beads is desired, the protein can be left uncoupled. However, the eluted protein may then 
obscure the presence of other specifically bound proteins. 
In vitro RNA expression 

Templates for transcription were made by linearization of pTRAP constructs with Xhol, 
■ phenol/chlorofoim extraction to remove the enzyme and ethanol precipitation. 25^1 
transcription reactions contained 1 |ag linearized pTRAP DNA, 5^1 5x T7 RNA polymerase 
buffer (400mM Tris-HCl pH 8.0, 60mM MgClz), S^d lOmM NTP mix, 1^1 0.75mM DTT 
■ (ElNAse free), 20U placental RNAse inhibitor (MBI), 15U T7 RNA polymerase and RNAse 
free water to 25^1. Reactions were incubated at 37°C for 2 hours, the resulting RNA 
precipitated using 0.4M UCl and 2.5 volumes of ethanol and the pellets resuspended in 40^1 
RNAse free water (Ambion). The yield of RNA product is approximately 25^g, which is the 
amount of RNA added to 1ml of Drosophila cytoplasmic extract (described below). 
Extract preparation 

Drosophila embryos were collected for 4 or 12 hours and aged an additional 4 hours. TRAP 
constructs in transgraiic embryos were induced using a 30 min heat pulse (Se-S^C ). 
Cytoplasmic extracts were prepared essentially as described by Moritz (SuUivan et al., 2000) 
with the following changes. TRAP Purification Buffer (5x TPB stock solution = 300mM 
HEPES pH 7.4, 50mM MgGl, 400mM NaCl, 0.5% Triton X-100) was used for aU steps of 
.- extract production. TPB working solution was made by adding glycerol to 10%, proteinase 
inhibitor (Complete EDTA free; Roche) and DTT (ImM final) to diluted stock solution. 
Dechorionated embryos were washed twice in the dounce with 3 volumes of TPB buffer and 
then removing all but enough buffer to just cover the embryos. Homogenized extract was 
passed just once through miracloth. The resulting filtrate was spun at 14,000g for 10 minutes 
(in microfuge or appropriate centrifiige tubes, depending on volume) and transferred to new 
tubes. Centrifugation was repeated as necessary until the filtrate was clear. If not used right 
away, glycerol was added to 20% final, and the extract flash fix)zen and stored at -70C. 
TRAP purification 

RNAse free conditions and solutions made with DEPC treated water were used throughout. 
For in vitro purifications, thawed lysate was re-centrifuged for 5 min at 14,000g, and lOp-g 
RNA added per ml of lysate. After incubation for 2-3 hours at 4*'C, the lysate was mixed with 
streptavidin agarose beads (Sigma: 200^1 beads/ml extract) pre-equiUbrated IxTPB solution. 
After gentle rocking for Ih at 4*'C, the mixture was added to an RNAse-free chromatography 
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column and aUowed to settle. Columns were then un-plugged, the unbound material allowed 
to flow-through and then washed three times with 1ml TPB. Bound complexes were eluted 
by plugging the columns, adding SOOjil Biotin elution buffer, (Ix TPB + 5mM d-Biotin, 
Sigma), incubating for Ihr at 4"C and then opened and the eluate collected. An additional 
250nl Biotin elution buffer was added to the column and the eluates pooled. An option at this 
point is to repeat the streptavidin affinity chromatography after first removing the biotin 
(using Avidin-agarose beads). 

Stireptavidin eluates were then bound to GST-MCP beads. Approximately 50^1 of GST-CP 
sepharose beads, pre-washed 3 times in Ix TPB, was used per SOO^il of stireptavidin eluate.. 
After rocking for Ih at 4°C, the mixtiire was transferred to a plugged RNAse-free mini 
column. After the beads settled, tiie colunm was unplugged, the unbound material allowed to 
flow-through and the beads washed three times with 1ml IxTPB. Bound complexes were 
eluted using either glutathione elution buffer (Pharmacia), high salt (5xTPB), RNAse (200m1 
of 2mg/mlRNAseA + SOOOu/mlRNAse Tl (Fermentas) or various denatiirants (eg. urea, 
SDS). This was done by adding one bed volume of elution buffer, incubating for 30 min, 
eluting, rinsing three times with elution buffer and pooling the four eluates. Proteins were 
then resolved by SDS PAGE and identified by Trypsin proteolysis. Mass Spectrometry 
(Fenyo 1998) and submission of the data to Drosophila genomic databases (Adams 2000). 
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